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1 XML: XML screamer: an integrated approach to high performance XML parsing. 



validation and deserialization 

Margaret G. Kostoulas, Morris Matsa, Noah Mendelsohn, Eric Perkins, Abraham Heifets, 
Martha Mercaldi 

May 2006 Proceedings of the 15th international conference on World Wide Web 
WWW '06 

Publisher: ACM Press 

Full text available: ^ pdf(3Q3.88 KB) Additional Information: full citation , abstract, references , index terms 

This paper describes an experimental system in which customized high performance XML 
parsers are prepared using parser generation and compilation techniques. Parsing is 
integrated with Schema-based validation and deserialization, and the resulting validating 
processors are shown to be as fast as or in many cases significantly faster than traditional 
nonvalidating parsers. High performance is achieved by integration across layers of 
software that are traditionally separate, by avoiding unnecessar ... 

Keywords: JAX-RPC, SAX, XML, XML schema, parsing, performance, schema compilation, 
validation 



Research session: XML query processing #3: Efficient evaluation of XQuerv over 

streaming data 

Xiaogang Li, Gagan Agrawal 

August 2005 Proceedings of the 31st international conference on Very large data 
bases VLDB '05 

Publisher: VLDB Endowment 



Full text available: ^ odf(302.57 KB) 



Additional Information: full citation , abstract , references , citings , index 
terms 



With the growing popularity of XML and emergence of streaming data model, processing 
queries over streaming XML has become an important topic. This paper presents a new 
framework and a set of techniques for processing XQuery over streaming data. As 
compared to the existing work on supporting XPath/XQuery over data streams, we make 
the following three contributions: 1. We propose a series of optimizations which transform 
XQuery queries so that they can be correctly executed with a single pass on t ... 

Software design, languages and systems: Reducing application code complexity with 
vocabulary-specific XML language binding s 

Jules White, Boris Kolpackov, Balachandran Natarajan, Douglas C. Schmidt 

March 2005 Proceedings of the 43rd annual southeast regional conference - Volume 

2 ACM-SE 43 
Publisher: ACM Press 
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Full text available: ^ pdf(498.52 KB) Additional Information: full citation , abstract , references , index terms 

The extensible Markup Language (XML) has become a ubiquitous data exchange and 
storage format. A variety of tools are available for incorporating XML-based data into 
applications. The most common XML tools (such as parsers for SAX and DOM) provide 
low-level vocabulary-independent interfaces, which can make it hard to develop and 
debug robust applications. This paper examines tools for generating vocabulary-specific 
XML-to-C+ + language mappings and shows how they can reduce key sources of 
complex ... 

Keywords: C++, DOM, SAX, W3C XML schema, XML, vocabulary-specific language 
binding 

4 When and how to develop domain-specific languages 9 
Marjan Mernik, Jan Heering, Anthony M. Sloane 
December 2005 ACM Computing Surveys (CSUR), volume 37 issue 4 

Publisher: ACM Press 

i- ii * a i ui 0 ,x /0 .o no isb\ Additional Information: full citation , abstract , references , citings , index 

Full text available: T!g pdf(318.02 KB ) - 

y£=jr terms 

Domain-specific languages (DSLs) are languages tailored to a specific application domain. 
They offer substantial gains in expressiveness and ease of use compared with general- 
purpose programming languages in their domain of application. DSL development is hard, 
requiring both domain knowledge and language development expertise. Few people have 
both. Not surprisingly, the decision to develop a DSL is often postponed indefinitely, if 
considered at all, and most DSLs never get beyond the applicatio ... 

Keywords: Domain-specific language, application language, domain analysis, language 
development system 

5 XML pa rsing and stylesheets: An adaptive, fast, and safe XML parser based on by te Q 
<g> sequences memorization 

^ Toshiro Takase, Hisashi MIYASHITA, Toyotaro Suzumura, Michiaki Tatsubori 

May 2005 Proceedings of the 14th international conference on World Wide Web 

WWW 05 
Publisher: ACM Press 

Full text available* fiCl odf(274 39 KB) Additional Information: full citation , abstract , references , citings . Index 
"™ terms 

XML (Extensible Markup Language) processing can incur significant runtime overhead in 
XML-based infrastructural middleware such as Web service application servers. This paper 
proposes a novel mechanism for efficiently processing similar XML documents. Given a 
new XML document as a byte sequence, the XML parser proposed in this paper normally 
avoids syntactic analysis but simply matches the document with previously processed 
ones, reusing those results. Our parser is adaptive since it partially p ... 

Keywords: SAX, XML parsers, automata 



Concrete syntax for objects: domain-specific language embedding and assimilation 

without restrictions 

Martin Bravenboer, Eelco Visser 

October 2004 ACM SIGPLAN Notices , Proceedings of the 19th annual ACM SIGPLAN 
conference on Object-oriented programming, systems, languages, and 
applications OOPSLA '04, Volume 39 Issue 10 

Publisher: ACM Press 

Full text available- fiQ Pdf(379.91 KB) Additional Information: full citation , abstract, references , otings, index 
^ terms , review 



http://portal.acm.org/resultsxfm^ 1/9/2007 



Results (page 1): xml sax code generation 



Page 3 of 7 



Application programmer's interfaces give access to domain knowledge encapsulated in 
class libraries without providing the appropriate notation for expressing domain 
composition. Since object-oriented languages are designed for extensibility and reuse, the 
language constructs are often sufficient for expressing domain abstractions at the 
semantic level. However, they do not provide the right abstractions at the syntactic level. 
In this paper we describe MetaBorg, a method for providing <i> ... 

Keywords: MetaBorg, SDF, concrete object syntax, domain-specific languages, 
embedded languages, extensible syntax, meta programming, rewriting, stratego, syntax 
extension 



7 Q focus: semi-structured data: XML <and semi-structured data> t ■ 
C. M. Sperberg-McQueen 
October 2005 Queue, Volume 3 issue 8 

Publisher: ACM Press 

Full text availableig ^^ Addjtjonal lnformatjon; fuM cita tion . abstract , references , index terms 

\9\ html(32,69 KB) 

XML, as defined by the World Wide Web Consortium in 1998, is a method of marking up a 
document or character stream to identify structural or other units within the data. XML 
makes several contributions to solving the problem of semi -structured data, the term 
database theorists use to denote data that exhibits any of the following 
characteristics: •Numerous repeating fields and structures in a naive hierarchical 
representation of the data, which lead to large numbers of tables in a sec ... 

8 Algorithms and programming models for efficient representation of XML for Internet 
applications 

Neel Sundaresan, Reshad Moussa 
April 2001 Proceedings of the 10th international conference on World Wide Web 
WWW '01 

Publisher: ACM Press 

Full text available: *{g pdf(352.97 KB) Additional Information: full citation , references , citings , index terms 

Keywords: DOM, SAX, WBXML, XML, compression 

9 Web and e-business application: A framework for automatic generation of web-based Q 

^ data entry applications based on XML 
^ Vol kerTu rau 

March 2002 Proceedings of the 2002 ACM symposium on Applied computing SAC '02 
Publisher: ACM Press 

.... . -i u. « ,x /fiOC ,7 vn\ Additional Information: full citation , abstract , references , citings , index 
Full text available: TO pdf(635.47 KB) 

ir — 1 terms 

This paper presents a framework for web-based data entry applications. It introduces a 
method for the conceptional and the navigational design based on a textual specification 
in the form of an XML-application. This forms the input to a code generation environment 
allowing for real automated prototyping. The environment produces fully functional 
skeletons for the web pages. Together with the framework classes they can be utilized for 
testing and for requirements review. They also form the starti ... 

Keywords: automated prototyping, frameworks, web-based data entry 

10 Software and document engineering: Supporting document and data views of source Qjj 
code 

^ Michael L. Collard, Jonathan I. Maletic, Andrian Marcus 
http://portal.acm.org/results.cfa^ 1/9/2007 
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November 2002 Proceedings of the 2002 ACM symposium on Document engineering 
DocEng '02 

Publisher: ACM Press 

Full text available:- g) P df(162.30KB^ Additional ,nf ° rmation: JJ^ 1 ' references ,cjtings, index 

The paper describes the use of an XML format to store and represent program source 
code. A new XML application, srcML (SouRCe Markup Language), is presented. srcML 
presumes a document view of source code where information about the syntactic 
structure is layered over the original source code document. The resultant multi-layered 
document has a base layer of all the original text (and formatting). The second layer is 
the syntactic information, derived from the grammar of the programming language, ... 

Keywords: XML, abstract syntax tree, markup language, program analysis, source code 



11 Utilizin g traceabilitv links: An XML based a pproach to s u pport the evolution of model- Q 
to-model traceability links 

Jonathan I. Maletic, Michael L. Collard, Bonita Simoes 

November 2005 Proceedings of the 3rd international workshop on Traceability in 
emerging forms of software engineering TEFSE '05 

Publisher: ACM Press 

Full text available: Qpdf(135.84 KB) Additional Information: full citation , abstract , references , index terms 

The paper summarizes the authors' current research on supporting model-to-model 
traceability. The authors present a graph theoretic definition of what they mean by 
models and traceability links. This definition is realized by the use of XML technologies to 
represent the models and traceability links. Practical means to represent different types of 
system models (e.g., source code and design) using XML are discussed. Traceability links 
are also implemented using XML technologies in an efficient a ... 




Keywords: maintenance, software evolution, traceability 



12 XML processing: A comprehensive solution to the XML-to-relational mapping 
problem 

Sihem Amer-Yahia, Fang Du, Juliana Freire 

November 2004 Proceedings of the 6th annual ACM international workshop on Web 
information and data management WIDM '04 

Publisher: ACM Press 

Full text available ^ pdffl 14.33 KB) Additional lnformatjon: Mdtatjgn, abstract, references, otings, index 
' ; terms 

The use of relational database management systems (RDBMSs) to store and query XML 
data has attracted considerable interest with a view to leveraging their powerful and 
reliable data management services. Due to the mismatch between the relational and XML 
data models, it is necessary to first shred and load the XML data into relational tables, 
and then btranslate XML queries over the original data into equivalent SQL queries over 
the mapped tables. Although there is a rich literature on XML-rel ... 

Keywords: XML shredding, XML storage, mapping techniques, relational databases 



13 Web-enabled simulation technologies: web-enabled applications: Flexible integration 
of XML into modeling and simulation systems 
Mathias Rohl, Adelinde M. Uhrmacher 

December 2005 Proceedings of the 37th conference on Winter simulation WSC '05 
Publisher: Winter Simulation Conference 

Full text available: Q pdf(421.56 KB) Additional Information: full citation , abstract , references 

As the effort towards standardization of formalism representations increases so does the 
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need for verifying whether models do or do not follow a standard. Data binding allows to 
systematically exploit XML and its associated technologies for modeling and simulation 
purposes. Based on the schema definition of a formalism, a binding compiler generates 
model classes that support the user in constructing models according to the formalism. 
Most constraints can be checked automatically, few require sep ... 

14 Using Javadoc and XML to produce API reference documentation j| 
Donald M. Leslie 

October 2002 Proceedings of the 20th annual international conference on Computer 
documentation SIGDOC '02 

Publisher: ACM Press 

Full text available: « odf(623.29 KB) Additi ° nal Information: full citation , abstract references , citings, index 
^ terms 

The creation of API reference documentation and its integration into larger documentation 
sets present a number of challenges. This paper explores a strategy for using Javadoc, 
the primary mechanism for generating Java API documentation, in conjunction with an 
XML infrastructure, to improve the quality of API reference material and its integration 
into product documentation sets. 

Keywords: API documentation, HTML, Javadoc, XML, XSL stylesheets, XSLT, doclets, 
document transformation 



15 Models and tools for generating digital libraries: SSL: a language for declarative 
^ specification and generation of digital libraries 
Marcos Andre, Gongalves, Edward A. Fox 

July 2002 Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries 
JCDL '02 

Publisher: ACM Press 

Full text available* 151 pdf(2 92 78 KB) Ad^' 003 ' Information: full citation , abstract , references, citings , index 
' T^J" : terms 

Digital libraries (DLs) are among the most complex kinds of information systems, due in 
part to their intrinsic multi disciplinary nature. Nowadays DLs are built within monolithic, 
tightly integrated, and generally inflexible systems — or by assembling disparate 
components together in an ad-hoc way, with resulting problems in interoperability and 
adaptability. More importantly, conceptual modeling, requirements analysis, and software 
engineering approaches are rarely supported, making it extrem ... 

Keywords: 5S, 5SL, design tools, generators, languages, models 



16 Techniques for document management and document engineering: A programming Q 
^ environment for demand-driven processing of network XML data and its performance 
evaluation 

Masakazu Yamanaka, Kenji Niimura, Tomio Kamada 

November 2005 Proceedings of the 2005 ACM symposium on Document engineering 
DocEng '05 

Publisher: ACM Press 

Full text available: ^] pdf(417.15 KB) Additional Information: full citation , abstract , references , index terms 

This paper proposes a programming environment for Java that processes network XML 
data in a demand-driven manner to return quick initial responses. Our system provides a 
data binding tool and a tree operation package, and the programmer can easily handle ^ 
network XML data as tree-based operations using these facilities. For efficiency, demand- 
driven data binding allows the application to start the processing of a network XML 
document before the arrival of the whole data, and our tree o ... 

Keywords: XML, data binding, demand-driven, multi-threading 
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17 Modeling methodology b: Open source initiatives for simulation software: next 
generation simulation environments founded on open source software and XML- 
based standard interfaces 
Thomas Wiedemann 

December 2002 Proceedings of the 34th conference on Winter simulation: exploring 

new frontiers WSC '02 
Publisher: Winter Simulation Conference 

Full text available: g pdf(181.61 KB) Additional Information: full citation , abstract , references 

During the Winter Simulation Conference 2001 the OpenSML-project was presented and 
started. The OpenSML-project is based on the Simulation Modeling Language 
(SML<sup>TM</sup>) and is an open source, web-based, multi-language simulation 
development project guided by a consortium of industrial, academic and government 
simulation consultants, practitioners and developers. For the simulation community, the 
open source movement represents an opportunity to improve the quality of common 
cor ... 

18 An access control method based on the prefix labelin g scheme for XML repositories 9 
Shohei Yokoyama, Manabu Ohta, Kaoru Katayama, Hiroshi Ishikawa 

January 2005 Proceedings of the 16th Australasian database conference - Volume 39 
ADC 05 

Publisher: Australian Computer Society, Inc. 

Full text available: ^ pdf(462.06 KB) Additional Information: full citation , abstract , references , index terms 

This paper describes an access control method of the XML repository system, 
SAXOPHONE, which was implemented at Tokyo Metropolitan University. The main feature 
of our research is a novel account identifier that is based on the prefix-labeling scheme to 
realize a hierarchical authorization. SAXOPHONE uses relational databases for XML 
document storage. Using it, any valid or well-formed XML document is decomposed into 
events of the SAX parser and is then stored into relational tables using a fixed ... 

Keywords: SAX parser, XML database, XML repository, access control, relational 
database 




19 Book reviews: Review of "XML and Java: developing web applications, 2nd edition by §§| 
^ Hiroshi Maruyama. et. al." Addison-Wesley 2002. 
^ Brian A. Lawler 

March 2003 ACM SIGSOFT Software Engineering Notes, volume 28 issue 2 

Publisher: ACM Press 

Full text available: ^pdfd 34.41 KB) Additional Information: full citation 



20 Optimizing encoding: An evaluation of binary xml encoding optimizations for fast 
stream based xml processing 
R. J. Bayardo, D. Gruhl, V. Josifovski, J. Myllymaki 

May 2004 Proceedings of the 13th international conference on World Wide Web 
WWW '04 

Publisher: ACM Press 

Full text available- 1?) pdf(255 72 KB) Addition 31 Information: full citation , abstract , references , citing s, index 

terms 

This paper provides an objective evaluation of the performance impacts of binary XML 
encodings, using a fast stream-based XQuery processor as our representative application. 
Instead of proposing one binary format and comparing it against standard XML parsers, 
we investigate the individual effects of several binary encoding techniques that are shared 
by many proposals. Our goal is to provide a deeper understanding of the performance 
impacts of binary XML encodings in order to clarify the ongoing ... 
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